Ref 

# 


Hits 


Search Query 


DBS 


Default 
Operator 


Plurals 


Time Stamp 


L4 


7649 


heterologous adj sequence 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 07:53 


L5 


83 


L4 and gip 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 07:53 


L6 


12 


L5 and preproglu$ 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 07:55 


L7 


229 


preproglu$ 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 07:55 


L8 


S3 


17 and @py<"2001" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 07:56 


L9 


39 


17 and @py<"2000" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 07:56 


LIO 


18 


17 and @py<"1998" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 08:55 


Lll 


6 


"6110707" and linker 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 08:55 


L12 


10 


"6110707" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 08:56 


L13 


3140 


"6110707" gIp 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 08:56 


L14 


3 


"6110707" and gip 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 08:59 


LIS 


2 


114 and fusion adj protein 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 09:14 


' L16 


0 


connolley adj s 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 09:14 


L17 


0 


connolley and gIp 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 09:14 


L18 


0 


connelley and gIp 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 09:15 
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L19 


12 


Connelly and gip 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 09:19 


L20 


6 


119 and fusion adj protein 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 09:21 


L21 


2 


"6723530" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 09:21 


L22 


1 


121 and gIp 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:10 


L23 

1 


326 


prepro$ and gIp 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:11 


L24 


37 


123 and @py<"2000" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:11 


L25 


10 


124 and heterologous 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:14 


L26 


599 


gIp and fusion adj protein 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:15 


L27 


40 


126 and @py<"2000" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:15 


L28 


13 


126 and @py<"1998" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:20 


L29 


72 


126 and @py>"1998" and 
@py<"2002" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 10:21 


1 L30 


15 


126 and @py>"1998" and 
@py<"2000" 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 11:16 


L31 


18 


wadsworth adj samuel 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 11:16 


L32 


20 


armentano adj donna 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 11:16 


L33 


120 


gregory adj ricliard 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 11:17 


L34 


3 


parsons adj geoffrey 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 11:17 



Search HIstor/ 1/27/06 11:22:12 AM Page 2 

C:\Documents and SetUngs\GChandra\My Documents\EAST\Workspaces\10794897.wsp 



L35 


137 


131 or 132 or 133 or 134 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 11:17 


G6 


3 


135 and gip 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


ON 


2006/01/27 11:18 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



January 3, 2006, 23:27:35 ; Search time 229 Seconds 

(without alignments) 
240.311 Million cell updates/sec 



Title: 

Perfect score: 
Sequence ; 



US-10-716-326-4 
402 

1 MKIILWLCVFGLFIiATLFPI. 



. SYLEGQAAKEFIAWLVKGRG 78 



Scoring table: 



BLOSUM62 
Gapop 10.0 



Gapext 0.5 



Searched: 



2166443 seqs, 705528306 residues 



Total number of hits satisfying chosen parameters; 



2166443 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



UniProt_05.80:* 

1 : uniprot_sprot : * 
2: uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


324 


80.6 


87 


1 


EXE4_HELSU 


P26349 


heloderma s 


2 


314 


78.1 


87 


2 


Q7SZU6 HELHR 


Q7szu6 


heloderma h 


3 


168.5 


41.9 


176 


1 


GLUC SHEEP 


Q8mj25 


o glucagon 


4 


168.5 


41.9 


180 


1 


GLUC_BOVIN 


P01272 


b glucagon 


5 


168. 5 


41.9 


180 


1 


GLUC_CANFA 


P29794 


c glucagon 


6 


168.5 


41.9 


180 


1 


GLUC_PIG 


P01274 


s glucagon 


7 


167.5 


41.7 


180 


1 


GLUC_CAVPO 


P05110 


c glucagon 


8 


167.5 


41.7 


180 


1 


GLUC_HUMAN 


P01275 


h glucagon 


9 


167.5 


41.7 


180 


1 


GLUC MESAU 


P01273 


m glucagon 


10 


167.5 


41.7 


180 


1 


GLUC MOUSE 


P55095 


m glucagon 


11 


167.5 


41.7 


180 


1 


GLUC_RAT 


P06883 


r glucagon 


12 


167.5 


41.7 


180 


2 


Q53TP6 HUMAN 


Q53tp6 


homo sapien 


13 


167 


41.5 


180 


1 


GLUC OCTDE 


P22890 


o glucagon 


14 


166.5 


41.4 


45 


2 


Q6PPF4_CAPHI 


Q6ppf 4 


capra hircu 


15 


164.5 


40.9 


206 


1 


GLUC_CHICK 


P68259 


g glucagon 



16 


154 . 5 


38. 


,4 


124 


2 


Q6RYB1 9SAUR 


Q6rybl 


agkistrodon 


17 


153. 5 


38. 


2 


120 


2 


Q6RYB7 ICTPU 


Q6ryb7 


ictalurus p 


18 


151. 5 


37. 


7 


204 


1 


GLUC HELSU 


012956 


h glucagon 


19 


149 


37. 


1 


145 


2 


Q6RYB5 NEOFS 


Q6ryb5 


neoceratodu 


20 


148 


36. 


8 


153 


2 


Q6RYB6 PRODO 


Q6ryb6 


protopterus 


21 


146. 5 


36. 


,4 


122 


1 


GLUC2 LOPAM 


P04092 


lophius ame 


22 


144 


35.8 


80 


2 


Q6IUP8 PHOSU 


Q6iup8 


phodopus su 


23 


140 


34 . 


8 


123 


2 


Q6RYA9"9PERC 


Q6rya9 


sebastes ca 


24 


139 


34 . 


. 6 


860 


2 


Q4RQJ4 TETNG 


04 rai 4 


tetraodon n 


25 


138 


34 . 


3 


220 


2 


Q8UWL9~9NEOB 


Q8uwl9 


hoplobatrac 


26 


134 


33. 


3 


149 


2 


Q6RYB2~BUFMA 


Q6ryb2 


bufo marinu 


27 


130 


32 . 


, 3 


266 


1 


GLUCl XENLA 


042143 


xenopus lae 


28 


130 


32 , 


, 3 


266 


2 


Q6DIZ4^ XENTR 


Q6diz4 


xenopus tro 


29 


128 


31 , 


, 8 


121 


2 


Q5PR39 BRARE 


Q5pr39 


brachydanio 


30 


127 


31 . 


. 6 


103 


1 


GLUC RANCA 


P15438 


rana catesb 


31 


125 


31 . 


1 


121 


2 


Q6RYC1 9PERC 


Q6rycl 


sebastes ca 


32 


125 


31 . 


, 1 


176 


2 


Q6RYC2''9PERC 


Q6ryc2 


sebastes ca 


33 


125 


31. 


. 1 


219 


1 


GLUC2 XENLA 


042144 


xenopus lae 


34 


125 


31, 


.1 


219 


2 


Q5D082 XENLA 


Q5d082 


xenopus lae 


35 


124 


30, 


.8 


66 


2 


Q788W6~ONCTS 


Q788w6 


oncorhynchu 


36 


124 


30, 


.8 


72 


2 


Q91409~ONCTS 


Q91409 


oncorhynchu 


37 


124 


30, 


.8 


122 


2 


Q6RYB8~ICTPU 


Q6ryb8 


ictalurus p 


38 


124 


30. 


.8 


173 


2 


Q6RYB9~ICTPU 


Q6ryb9 


ictalurus p 


39 


124 


30. 


.8 


178 


1 


GLUCl ONCMY 


Q91971 


oncorhynchu 


40 


122.5 


30. 


.5 


176 


2 


Q6RYB0 9PERC 


Q6ryb0 


sebastes ca 


41 


122 


30. 


.3 


30 


1 


GLUCL ANGAN 


P63294 


anguilla an 


42 


122 


30. 


.3 


30 


1 


GLUCL^ANGRO 


P63295 


anguilla ro 


43 


120 


29. 


.9 


121 


1 


GLUC^CARAU 


P79695 


carassius a 


44 


120 


29, 


.9 


124 


2 


Q4S308 TETNG 


Q4s308 


tetraodon n 


45 


119 


29, 


.6 


160 


1 


GLUC1_PETMA 


Q9purl 


petromyzon 



ALIGNMENTS 



RESULT 1 
EXE4_HELSU 

ID EXE4_HELSU STANDARD; PRT; 87 AA, 

AC P26349; 

DT Ol-MAY-1992 (Rel. 22, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT . lO-MAY-2005 (Rel. 47, Last annotation update) 

DE Exendin-4 precursor. 

OS Heloderma suspectum (Gila monster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Lepidosauria; Squamata; Scleroglossa; Anguimorpha; Helodermatidae; 

OC Heloderma . 

OX NCBI_TaxID=8554 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX MEDLINE=97172477; PubMed=9020121; DOI=10 . 1074/ jbc . 272 . 7 . 4335 ; 

RA Chen Y.E., Drucker D.J.; 

RT "Tissue-specific expression of unique mRNAs that encode proglucagon- 

RT derived peptides or exendin 4 in the lizard."; 

RL J. Biol. Chem. 272:4108-4115(1997). 

RN [2] 

RP PROTEIN SEQUENCE OF 48-86. 



RC TISSUE=Venom; 

RX MEDLINE=92218391; PubMed=1313797 ; 

RA Eng J., Kleinman Singh L., Singh G., Raufman J. -P.; 

RT "Isolation and characterization of exendin-4, an exendin-3 analogue, 

RT from Heloderma suspectum venom. Further evidence for an exendin 

RT receptor on dispersed acini from guinea pig pancreas."; 

RL J. Biol. Chem. 267:7402-7405(1992). 

RN [3] 

RP STRUCTURE BY .NMR OF 4 8-8 6. 

RX PubMed=11683627; DOI=10 . 1021/bi010902s ; 

RA Neidigh J.W., Fesinmeyer R.M. , Prickett K.S., Andersen N.H.; 

RT "Exendin-4 and glucagon-like-peptide-1 : NMR structural comparisons in 

RT the solution and micelle-associated states."; 

RL Biochemistry 40 : 13188-13200 (2001 ) . 

CC -!- FUNCTION: Has a VIP/secretin-like biological activity. Interacts 
CC with the exendin receptor. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -I- TISSUE SPECIFICITY: Expressed by the venom gland. 

CC -!- SIMILARITY: Belongs to the glucagon family. 

CC 

CC This Swiss-Prot entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use as long as its content is in no way modified and this statement is not 

CC removed. 

CC 

DR EMBL; U77613; AAB51130.1; -; mRNA. 

DR PIR; A42486; HWGH4G. 

DR PDB; IJRJ; NMR; A=48-86. 

DR InterPro; IPR000532; Glucagon. 

DR Pfam; PF00123; Hormone_2; 1. 

DR SMART; SM00070; GLUCA; 1. 

DR PROSITE; PS00260; GLUCAGON; 1. 

KW 3D-structure; Amidation; Direct protein sequencing; Glucagon family; 

KW Signal; Toxin. 



FT SIGNAL 


1 


23 




Potential . 


FT PROPEP 


24 


47 






FT PEPTIDE 


48 


86 




Exendin-4 . 


FT MOD__RES 


86 


86 




Serine amide (G-87 provides amide 


FT TURN 


52 


53 






FT HELIX 


54 


74 






FT TURN 


75 


76 






FT HELIX 


77 


79 






SQ SEQUENCE 


87 AA; 


9479 


MW; 


656BA6E3D87454A2 CRC64; 


Query Match 




80 


.6%; 


Score 324; DB 1; Length 87; 


Best Local 


Similarity 


81 


.6%; 


Pred. No. 5.4e-28; 



Matches 62; Conservative 



5; Mismatches 



9; Indels 



0; Gaps 



0; 



Qy 

Db 

Qy 

Db 



1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

1 I I i I M I I M I I I M I I I I I I I I I M I I I I I I M I I I I I I I : I I I M I I I I I I I : I 
1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFASKIKRHGEGTFTSDLSKQ 60 

61 LEGQAAKEFIAWLVKG 76 

: I : I : I I I I I 
61 MEEEAVRLFIEWLKNG 76 



GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd- 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



January 3, 2006, 23:17:55 ; Search time 185 Seconds 

(without alignments) 

185.252 Million cell updates/sec 

US-10-716-326-4 
402 

1 MKIILWLCVFGLFLATLFPI SYLEGQAAKEFIAWLVKGRG 78 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 2443163 seqs, 439378781 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2443163 



Database 



A_Geneseq_21 : * 

1: geneseqpl980s 

2: geneseqpl990s 

3: geneseqp2000s 

4: geneseqp2001s 

5: geneseqp2002s 

6: geneseqp2003a 

7: geneseqp2003b 

8: geneseqp2004s 

9: geneseqp2005s 



★ 
★ 
★ 
★ 
* 

* 
* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


402 


100.0 


78 


6 


ABP98121 


Abp98121 Amino aci 


2 


402 


100.0 


78 


9 


ADZ80126 


Ad280126 Exendin-4 


3 


364 


90.5 


80 


6 


ABP98129 


Abp98129 Amino aci 


4 


364 


90.5 


80 


9 


ADZ80142 


Adz80142 Exendin-4 


5 


325.5 


81.0 


77 


6 


ABP98122 


Abp98122 Amino aci 


6 


325.5 


81. 0 


77 


9 


ADZ80128 


Adz80128 Helodermi 


7 


324 


80.6 


87 


2 


AAW70288 


Aaw70288 Heloderma 


8 


324 


80.6 


87 


7 


ADF15336 


Adfl5336 Human alb 



9 


324 


80 . 


6 


87 


7 


ADF15337 


Adf 15337 


iiu-iiiciii dxjkj 


10 


324 


80 . 


6 


87 


7 


ADH21405 


Adh21405 


Human ^vt* 


11 


324 


80 . 


6 


87 


7 


AnH21406 

■C\JJl 1 ^ X 1 V w 


Adh21406 


Human ^vf" 


12 


231 


57 . 


5 


47 


5 


ARUQ1 Q76 


Abu91976 


f^l A a TTi^^n c 


i. -J 


198 


49 . 


3 


64 


8 


AnT.Q91 S3 


Adl92153 




14 


195 


48 . 


5 


84 




ABP98124 


AbD98124 


Ami no a r* i 


15 


195 


48 . 


5 


84 


9 


Anz80132 


Adz80132 


IGF-l ( fur 

X W 1> X \ X U X 


16 


192 


47 . 


8 


82 


5 


ABP98123 

t ^ U X ^ -J 


AbD98123 


Amino aci 


1 7 


192 


47 . 


8 


82 


9 


An7. ft 01 30 


AdzSOl 30 


C^TP GTiP-l 

\3 A. U t \3 AJ U X 


18 


184 . 5 


45 . 


9 


77 


5 


ABP98 128 


AbD98128 


Amino aci 


1 Q 


X O *1 • «J 


45 


Q 


77 


9 


An?. 8 01 4 0 


Adz8014 0 




20 


178 


44 . 


3 


386 


4 


AAR30703 


Aab30703 


A Ra 1 111 

IX LJ a s^A. A. A. \A 


21 


176 . 5 


43 . 


9 


77 


7 


ADFI 6689 


Adf 1668 9 


Hi iTTia n a 1 V) 




1 7fi S 


4 ? 


Q 


77 


7 


AnH9 1 ft ft 9 


AHh91 ftfiQ 


Human fiT.P 

liLUUClli \JAJU 


^ o 


1 7fi S 


4 

*i -J . 


Q 




7 


AHFI 6596 


Adf 1 6596 


HiiTTii^n alV^ 
nuiiiciii cixj^y 




176 5 


43 


Q 


662 


7 


AnH21 814 


Adh21814 


Hiima n a 1 Vi 




176 


4 ? 


ft 


V ** 


7 


ADFI 64 69 


Adf 16462 


Human a 1 Vi 

ilUllllClll CkA,t>J 


26 


176 


43 . 


8 


654 


7 


ADH21777 


Adh21777 


Human alb 


91 


172 S 


49 


9 


89 


7 


ADFI 61 95 


Adf 16195 

r^sji X X v X ^ x.^ 


Huma n a 1 h 


0 ft 


179 ^ 


4 9 


Q 

-7 


ft Q 


7 


AnH91 6S9 


Adh91 6S9 


Human riT.P 




179 S 


4 9 


Q 




7 


ADFI 704 4 


Adf 17044 


Huma n a 1 h 




179 S 


49 


Q 


673 


7 


AnH99037 


Adh22037 


Mou ^ ^ a 1 h 


•J J. 


179 S 


42 . 


9 


574 


7 


ADFI 6193 

f\LJ L X V,/ X ^ <J 


Adf 16193 


Human alb 




179 S 


4 9 


q 


fi7 4 


7 


AnH91 650 


Adh21650 


Huma n alb 


o o 


1 79 ^ 


4 9 


q 


fi74 


q 


AnW4 ^909 


Adw4 5202 


K 1 art i s 

1 V . A. CL\^ \^ A. 




179 ^ 


4 9 


q 


Q1 S 

Z7 X 


q 


AnW4 5904 


Adw4 52 04 


K lactis 

1\ . X O ^ w X i9 




179 


4 9 


Q 

o 


U O -J 


7 


ADFI 651 9 


Adf 16512 

J^SJ^ A- A- \J -J A. 


Human alb 




179 


42 . 


8 


663 


7 


ADH9 18 03 


Adh21803 


Human alb 


o / 


171 S 


4 9 


7 


fin 


7 


ADFI 7 04 8 


Adf 17048 


Human alb 

liuiiiuii cixiii^ 


o o 


171 ^ 

X / X * ^ 


49 


7 


60 


7 


ADH92038 


Adh22038 


Human GLP 

ituiiiaii y^jj^ 




171 R 


4 9 


7 


ft 


7 


AHFI 6fift7 


Adf 1 6687 

X X U V (J r 


Hnma n alb 

iiuiiiciii a X 


40 


171.5 


42. 


1 


83 


1 


ADH21887 


Adh21887 


Human GLP 


41 


171.5 


42. 


1 


145 


1 


ADF16688 


Adfl6688 


Human alb 


42 


171.5 


42. 


1 


145 


1 


ADH21888 


Adh21888 


Human GLP 


43 


171.5 


42. 


1 


664 


7 


ADF16510 


Adfl6510 


Human alb 


44 


171.5 


42. 


1 


664 


7 


ADH21801 


Adh21801 


Human alb 


45 


171.5 


42. 


1 


668 


7 


ADF16524 


Adfl6524 


Human alb 



ALIGNMENTS 



RESULT 1 

ABP98121 

ID ABP98121 standard; protein; 78 AA. 
XX 

AC ABP98121; 

XX 

DT ll-AUG-2003 (first entry) 
XX 

DE Amino acid sequence of proexendin leader linked to modified GLP-1. 
XX 

KW Glucagon-like peptide 1; GLP-1; blood sugar disorder; diabetes; 

KW hyperglycemia; hypoglycaemia; hypoinsulinisra; insulin production; 

KW proexendin. 
XX 

OS Synthetic. 



OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Peptide 1. .47 

FT /note= "proexendin leader" 
XX 

PN WO2003014318-A2. 
XX 

PD 20-FEB-2003. 
XX 

PF 07-AUG-2002; 2002WO-US025227 . 
XX 

PR 08-AUG-2001; 2001US-0310982P . 
XX 

PA (GENZ ) GENZYME CORP. 
XX 

PI Wadsworth.^SC, Armentano D, Gregory RJ, Parsons G; 
XX 

DR WPI; 2003-256570/25. 

DR N-PSDB; ACC43465. 
XX 

PT New glucagon-like peptide 1 (GLP-1) precursor comprising a mammalian GLP- 

PT 1 linked to a heterologous signal sequence, useful for treating blood 

PT sugar disorders, e.g. Type I and Type II diabetes, hyperglycemia, or 

PT hypoglycemia . 
XX 

PS Claim 16; Fig 2; 69pp; English. 
XX 

CC The present sequence is a proexendin leader linked to a Gly-8 modified 

CC human glucagon-like peptidfe^-i'^TtSI^-l) . The specification describes 

CC compositions, expression vectors and host cells comprising nucleic acid 

CC which encodes a precursor GLP-1 linked to a heterologous signal sequence. 

CC Nucleic acids encoding GLP-1 may be used to treat blood sugar disorders 

CC such as Type I and Type II diabetes, hyperglycemia, hypoglycaemia, and 

CC hypoinsulinism and to promote or stimulate insulin production 
XX 

SQ Sequence 78 AA; 



Query Match 100.0%; Score 402; DB 6; Length 78; 

Best Local Similarity 100.0%; Pred. No. 2.4e-40; 

Matches 7'8'7"^ Conservative 0; Mismatches 0; Indels 0; Gaps 0 

/) 

Qy 1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

I I I I I I I I I Ml I I I I I I I I I I I I I I I M I I I I I I I I I I I Ml I I I I I I I I I I I i I I I I I 
Db ---1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

Qy 61 LEGQAAKEFIAWLVKGRG 78 — . 

M M M M M M M M M 
Db 61 LEGQAAKEFIAWLVKGRG 78 



RESULT 2 
ADZ80126 

ID ADZ80126 standard; protein; 78 AA. 
XX 

.AC ADZ80126; 
XX 



DT 28-JUL-2005 (first entry) 
XX 

DE Exendin-4 .GLP-lGly8 amino acid sequence SEQ ID NO: 4. 
XX 

KW glucagon-like peptide 1; fusion protein; antidiabetic; hepatotropic; 

KW metabolic; insulin dependent diabetes; non-insulin dependent diabetes; 

KW metabolic disorder; liver disease; exendin. 
XX 

OS Mammalia. 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Peptide 1. .47 

FT /note= "Proexendin signal peptide" 

FT Protein 48. .78 

FT /note= "GLP-lGly8" 

XX 

PN US2005107318-A1. 
XX 

PD 19-MAY-2005. 
XX 

PF 17-NOV-2003; 2003US-00715976 . 
XX 

PR 17-NOV-2003; 2003US-007 15976 . 
XX 

PA (WADS/) WADSWORTH S. 

PA (ARME/) ARMENTANO D. 

PA (GREG/) GREGORY R J. 

PA (PARS/) PARSONS G. 
XX 

PI Wadsworth S, Armentano D, Gregory RJ, Parsons G; 
XX 

DR WPI; 2005-354819/36. 

DR N-PSDB; ADZ80125. 
XX 

PT Treating diabetes comprises administering a nucleic acid encoding a 

PT precursor Glucagon-like peptide 1 (GLP-1) comprising mammalian GLP-1 

PT linked to a heterologous signal sequence. 
XX 

PS Example 1; SEQ ID NO 4; 70pp; English. 
XX 

CO The invention relates to a method which comprises administering to an 

CC individual a nucleic acid encoding a precursor glucagon-like peptide 1 

CC (GLP-1) comprising mammalian GLP-1 linked to a heterologous signal 

CC sequence, where the precursor GLP-1 is cleaved in vivo or ex vivo which 

CC results in generation of activated GLP-1 in the individual, where the 

CC activated GLP-1 reduces plasma triglyceride levels or lipid accumulation 

CC in an organ. Also described: (1) nucleic acids which encode GLP-1; (2) 

CC vectors comprising the nucleic acids; and (3) host cells comprising the 

CC nucleic acid. The method is useful for treating Type I diabetes (insulin 

CC dependent diabetes) and Type II diabetes (non-insulin dependent 

CC diabetes), metabolic syndrome and non-alcoholic fatty liver disease. The 

CC present sequence represents the exendin-4 . GLP-lGly8 GLP-1 fusion protein, 

CC which is used in an example from the present invention. 

XX 

SQ Sequence 78 AA; 



Query Match 100.0%; Score 402; DB 9; Length 78; 

Best Local Similarity 100.0%; Pred. No. 2.4e-40; 

Matches 78; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 
Db 1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

Qy 61 LEGQAAKEFIAWLVKGRG 78 

I I I I I I I I I I I I I I I I I I 
Db 61 LEGQAAKEFIAWLVKGRG 78 



RESULT 3 
ABP98129 

ID ABP98129 standard; protein; 80 AA. 
XX 

AC ABP98129; 
XX 

DT ll-AUG-2003 (first entry) 

XX 

DE Amino acid sequence of proexendin leader linked to modified GLP-1. 

XX 

KW Glucagon-like peptide 1; GLP-1; blood sugar disorder; diabetes; 

FCW hyperglycemia; hypoglycaemia; hypoinsulinism; insulin production; 

KW proexendin. 
XX 

OS Synthetic. 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Peptide 1. .40 

FT /note= "proexendin leader" 
XX 

PN WO2003014318-A2. 

XX 

PD 20-FEB-2003. 
XX 

PF 07-AUG-2002; 2002WO-US025227 . 

XX 

PR 08-AUG-2001; 2001US-0310982P . 
XX 

PA (GENZ ) GENZYME CORP. 
XX 

PI Wadsworth SC, Armentano D, Gregory RJ^ Parsons G; 
XX 

OR WPI; 2003-256570/25. 

OR N-PSDB; ACC43473. 
XX 

PT New glucagon-like peptide 1 (GLP-1) precursor comprising a mammalian GLP- 

PT 1 linked to a heterologous signal sequence, useful for treating blood 

PT sugar disorders, e.g. Type I and Type II diabetes, hyperglycemia, or 

PT hypoglycemia. 
XX 

PS Claim 16; Fig 10; 69pp; English. 

XX 

CC The present sequence represents a proexendin leader linked to a Gly-8 



GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 5, 2006, 00:00:23 ; Search time 169 Seconds 

(without alignments) 
820.413 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-716-326-4 
402 

1 MKIILWLCVFGLFLATLFPI SYLEGQAAKEFIAWLVKGRG 78 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7.0 



Searched: 



1303057 seqs, 888780828 residues 



Total number of hits satisfying chosen .parameters : 2606114 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/USl0716326/runat_03012006_143904_8663/app_query. fasta_ 
3 

-DB=Issued_Patents__NA -QFMT=fastap -SUFFIX=p2n. rni -MINMATCH=0 . 1 -LOOPCL=0 

-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US10716326_@CGN_l_l_193_@runat_03012006_143904_8 663 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DEL0P=6 -DELEXT=7 



Database 



/cgn2_6/ptodata/l/ina/l_COMB. seq: * 
/cgn2_6/ptodata/l/ina/5_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/l/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/l/ina/H_COMB. seq: * 
/cgn2_6/ptodata/l/ina/PCTUS_COMB.seq: * 
/cgn2_6/ptodata/l/ina/PP_COMB. seq: * 
/cgn2_6/ptodata/l/ina/RE_COMB. seq:* 
/cgn2_6/ptodata/l/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


324 


80. 


6 


492 


3 


US-09-019-172-1 


Sequence 


1/ Appli 




2 


167 . 5 


41. 


7 


895 


3 


Us-08-7 84-582-55 


Sequence 


55, Appl 




3 


167 . 5 


41. 


7 


955 


3 


US-08-784-582-57 


Sequence 


57, Appl 




4 


167 . 5 


41 . 


7 


955 


3 


US-08-784-582-60 


Sequence 


60, Appl 




5 


167 . 5 


41. 


7 


1034 


3 


US-09-635-679E-1 


Sequence 


1, Appli 




6 


167.5 


41. 


7 


1062 


3 


US- 09- 01 6- 4 34- 12 12 


Sequence 


1212, Ap 




7 


167.5 


41. 


7 


2356 


3 


US- 08-7 84-582-72 


Sequence 


72, Appl 




8 


163 


40. 


5 


144 


2 


US- 08- 835-2 3 1-17 


Sequence 


17, Appl 




9 


163 


40. 


5 


144 


3 


US- 09- 108- 661- 17 


Sequence 


17, Appl 




10 


157 


39. 


1 


528 


2 


Us-08-835-231-7 


Sequence 


1, Appli 




11 


157 


39. 


1 


528 


2 


US-08-835-231-8 


Sequence 


8, Appli 




12 


157 


39. 


1 


528 


3 


US-09-108-661-7 


Sequence 


7, Appli 




13 


157 


39. 


1 


528 


3 


US-09-108-661-8 


Sequence 


8, Appli 




14 


118 


29. 


4 


78 


2 


US- 08 -82 9- 8 7 6-22 


Sequence 


22, Appl 




15 


118 


29. 


4 


78 


3 


US-09-234-874A-22 


Sequence 


22, Appl 




16 


118 


29. 


4 


78 


3 


US-09-234-873A-22 


Sequence 


22, Appl 




17 


118 


29. 


4 


78 


3 


US- 10-07 6-604-22 


Sequence 


22, Appl 




18 


102 


25. 


4 


207 


2 


US-08-193-863-3 


Sequence 


3, Appli 




19 


102 


25. 


4 


207 


2 


US-08-377-833-3 


Sequence 


3, Appli 




20 


102 


25. 


4 


207 


2 


US- 08-324-502-3 


Sequence 


3, Appli 




21 


102 


25. 


4 


207 


2 


US-08-083-501-3 


Sequence 


3, Appli 




22 


102 


25. 


4 


207 


2 


Us-08-415-939-3 


Sequence 


3, Appli 




23 


99 


24. 


6 


110 


2 


Us-07-741-931-8 


Sequence 


8, Appli 


Q 


24 


99 


24. 


6 


110 


2 


Us-07-741-931-9 


Sequence 


9, Appli 




25 




24. 


6 


X X 


2 


US-07-937-132A-8 


Sequence 


8, Appli 


Q 


26 


99 


24. 


6 


110 


2 


US-07-937-132A-9 


Sequence 


9, Appli 






Z? -J 


23. 


1 


138 

X w o 


3 


V^kJ VJJ Vi/X^ WTf X^W 


Sequence 


14 6, App 




28 


89.5 


22. 


3 


255 


3 


us- 09-2 8 0-030-51 


Sequence 


51, Appl 




29 


89 


22. 


1 


57 


2 


US-08-811-028-43 


Sequence 


43, Appl 






O Z7 


22. 


1 




9 
c. 


UO V/ /IX _70X X 


Sequence 


1, Appli 








22. 


1 


fi7 


0 
c 


VJkJ L// J ^ 1 X«J ^T\. X 


Sequence 


1, Appli 




32 


89 


22. 


1 


87 


3 


us- 09-2 80-030-59 


Sequence 


59, Appl 


c 


33 


85 


21. 


1 


73 


2 


US-08-829-876-24 


Sequence 


24, Appl 


c 


34 


85 


21. 


1 


73 


3 


US-09-234-874A-24 


Sequence 


24, Appl 


c 


35 


85 


21. 


1 


73 


3 


US-09-234-873A-24 


Sequence 


24, Appl 


c 


36 


85 


21. 


1 


73 


3 


US-10-076-604-24 


Sequence 


24, Appl 




37 


84 


20. 


9 


107 


2 


US-07-741-931-10 


Sequence 


10, Appl 




38 


84 


20. 


9 


107 


2 


US-07-937-132A-10 


Sequence 


10, Appl 


c 


39 


79 


19, 


7 


48 


2 


US-08-811-028-44 


Sequence 


44, Appl 


c 


40 


79 


19. 


7 


107 


2 


US-07-741-931-11 


Sequence 


11, Appl 


c 


41 


79 


19. 


7 


107 


2 


US-07-937-132A-11 


Sequence 


11, Appl 


c 


42 


78 


19. 


4 


53 


2 


US-08-811-028-46 


Sequence 


4 6, Appl 




43 


74.5 


18. 


5 


42954 


3 


US-09-949-016-17123 


Sequence 


17123, A 




44 


74.5 


18. 


,5 


42954 


3 


US-09-949-016-17124 


Sequence 


17124, A 




45 


71.5 


17, 


8 


711 


3 


US-09-949-016-3581 


Sequence 


3581, Ap 



ALIGNMENTS 



GenCore version 5,1.6 
Copyright (c) 1993 - 2006 Compugen Ltd, 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 4, 2006, 17:16:20 ; Search time 3647 Seconds 

(without alignments ) 

1215.736 Million cell updates/sec 



Title: US- 10-716-326-4 

Perfect score: 402 

Sequence: 1 MKIILWLCVFGLFLATLFPI SYLEGQAAKEFIAWLVKGRG 78 



Scoring table: 



BLOSUM62 

Xgapop 10.0 , Xgapext 0 . 5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7 . 0 



Searched: 



5883141 seqs, 28421725653 residues 



Total number of hits satisfying chosen parameters: 



11766282 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US107 16326/ runat_03012006_143904_8645/app_query. fasta_ 

3 

-DB=GenEmbl -QFMT=fastap -SUFFIX=p2n . rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US10716326_@CGN_l_l_4939_erunat_03012006_143904_8645 -NCPU=6 -ICPU=3 
-NO__MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV__TIMEOUT=120 -WARN__TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP 
-FGAPEXT=7 -YGAPOP==10 -YGAPEXT=0.5 -DEL0P=6 -DELEXT=7 



Database 



GenEmbl : * 
1 : gb_ba : * 
2 : gb_in : * 
3: gb_env:* 
gb_om: * 
gb_o V : * 
gb_pat : * 
gb_ph : * 
gb_pr : * 
gb_ro : * 
gb_sts : ^ 



4: 
5: 
6: 
7: 
8: 
9; 
10: 



11: 


gb sy:* 


12: 


gb_un : * 


13: 


gb_vi : * 


14: 


gb_htg : * 


15: 


gb_pl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





S CO re 


Match Length 




in 

X u 




1 
J. 


324 


80. 


6 


465 


5 


HSU77613 


U77613 Heloderma s 


O 




80. 


6 


492 


\j 


r\.r\ -J £, \j ^ £, ^ 




-3 


•J X *1 


78. 


1 


397 


c; 

•J 


kilWJ ■tJ \J \J >J \J ^ 




A 
■1 


J. *3 


78. 


1 


471 






AX894594 Seouence 


c: 

u 


X o / • ^ 


46. 


6 


458 




AyR9 4 SP'^ 


AX894593 Serjnenrp 

\J ^ ^ ^ £^ ■^J (J V»i ~ i 1 Vw- v3 




172 


42. 


8 


396 


5 


AXl 4 767 S 


AX147675 Seauence 


/ 


in ^ 

X / X • «J 


42. 


7 


1104 


Q 


Wv^WO J-iVJ 


M'^7fiRR Of^t-oHon Herr 


fi 

o 


X U u . «J 


41. 


9 


559 


A 
I 


r\L ^ £. J X o ^ 


AF52918S Ovis arie 


Q 


X o o • ^ 


41. 


9 


574 


A 
*i 


AY94 9194 


AY949194 Sn<? <zrrnf 




X O 0 • 


41. 


9 


1108 


1 








X oo . o 


41. 


9 


1123 


A 
4 


r\C J U O *i O 3 


AF'^n84'^Q Pani <5 fam 


1 9 
X^ 


X o / . o 


41. 


7 


543 


Ci 
D 


pen77'^1 4 


0^077*^14 *?emifanr'e 


X o 


X o / . o 


41. 


7 


543 


Q 
O 


RTOdfiftl 

Dl VJUDOXO 


RTODfiftl"^ Homo c;ar)i 


X 4 


X U / • <J 


41. 


7 


543 


X X 


X\X O J/ W V u o 


AY890068 Svnthetic 


X sj 


1 fi7 =^ 


41. 


7 


543 


X X 


O 7 V/ V u ? 


AY890069 Svnthetic 


X D 


X O / . .J 


41. 


7 


543 


1 1 

-1- J. 


AYRQ9S4 7 


AY899547 Svnthptic: 


1 7 


1 fi7 S 

J. O / . J 


41. 


7 


543 


X X 


AYflQ9S4 R 


AY899S48 Svnthptic 


X 0 


X D / . J 


41, 


7 


543 


1 1 

X X 




RTnn7Sn7 «?vn1-hp1-ir 


1 Q 


X D / . 0 


41. 


7 


668 


Q 


jyirl r r KU o Xi vj 


*? A A ^ M miic/^iiliie 
Zift00 4^ rl.mUoCUXUS 




X D / . J 


41. 


7 


895 


D 


AKX UO X UO 


ZiDl nQ1 QAmiAn^A 

/\r\xuoxuo oequence 


O 1 


X b / . J 


41. 


7 


955 


D 


7\Di npi n7 
AK X u o X u / 


/\KXvJoxu/ occ^uence 


o o 


i D / . O 


41. 


7 


955 


D 


AKXUo xuy 


/\r\Xvjoxu!7 oequence 




1 o / . o 


41. 


7 


1034 


Ci 
O 


A'^ 1 4 91 


A'^1491 H c; aT^ i one m 
X 4 ^ X n * o d^X did lU 


24 


167.5 


41. 


7 


1034 


6 


AR634243 


AR634243 Sequence 


25 


167.5 


41. 


7 


1036 


6 


CQ729100 


CQ729100 Sequence 


26 


167.5 


41. 


7 


1053 


9 


GPIGG 


D00014 Cavia porce 


27 


167.5 


41. 


7 


1062 


6 


AR270649 


AR270649 Sequence 


28 


167.5 


41. 


7 


1062 


8 


HUMGLUC 


J04040 Human gluca 


29 


167.5 


41. 


7 


1102 


9 


BC012975 


BC012975 Mus muscu 


30 


167.5 


41. 


7 


1116 


9 


AF276754 


AF276754 Mus muscu 


31 


167.5 


41. 


7 


1118 


9 


HAMGG 


J00059 Syrian hams 


32 


167.5 


41. 


7 


1128 


6 


CS077369 


CS077369 Sequence 


33 


167.5 


41. 


7 


1154 


8 


BC005278 


BC005278 Homo sapi 


34 


167.5 


41. 


7 


12356 


6 


AR108119 


AR108119 Sequence 


35 


167. 5 


41. 


7 


3798 


6 


CQ493428 


CQ493428 Sequence 


36 


167.5 


41. 


7 


4146 


6 


CQ414344 


CQ414344 Sequence 


37 


166.5 


41. 


4 


228 


6 


AX840858 


AX840858 Sequence 


38 


166.5 


41. 


4 


277 


9 


RATGLU4 


K02811 Rat glucago 


39 


166.5 


41. 


4 


707 


4 


AY588290 


AY588290 Capra hir 


40 


166.5 


41. 


4 


6455 


8 


HSGLUC 


V01515 Human gene 


41 


166.5 


41. 


4 


10050 


8 


HSGLUCG2 


X03991 Human gluca 


c 42 


166.5 


41. 


4 


163681 


8 


AC007750 


AC007750 Homo sapi 



GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: January 3, 2006, 23:30:36 



; Search time 38 Seconds 

(without alignments) 

197.498 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-716-326-4 

402 

1 MKIILWLCVFGLFLATLFPI. 



. SYLEGQAAKEFIAWLVKGRG 78 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 
283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 



283416 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

PIR 80:* 



pirl : * 
pir2: * 
pir3:* 
pir4 : * 



Pred, No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


168.5 


41. 9 


180 


1 


GCBO 


glucagon precursor 


2 


167.5 


41.7 


180 


1 


GCHY 


glucagon precursor 


3 


167.5 


41.7 


180 


1 


GCGP 


glucagon precursor 


4 


167.5 


41.7 


180 


1 


GCHU 


glucagon precursor 


5 


167.5 


41.7 


180 


1 


GCRT 


glucagon precursor 


6 


167.5 


41.7 


180 


2 


A57294 


glucagon precursor 


7 


167 


41.5 


180 


1 


GCRTDU 


glucagon precursor 


8 


164.5 


40.9 


151 


1 


GCCH 


glucagon precursor 


9 


164.5 


40.9 


206 


2 


151301 


proglucagon - chic 


10 


157.5 


39.2 


158 


1 


GCPG 


glucagon precursor 


11 


146. 5 


36. 4 


122 


1 


GCAF2 


glucagon 2 precurs 


12 


128 


31, 8 


101 


1 


GCFGB 


glucagon precursor 


13 


124 


30.8 


66 


2 


151093 


glucagon - chinook 



14 


124 


30. 


8 


178 


2 


151058 


glucagon I precurs 


15 


122 


30. 


3 


30 


2 


C61125 


glucagon-like pept 


16 


122 


30. 


3 


30 


2 


B61125 


glucagon-like pept 


17 


119 


29. 


6 


178 


2 


151057 


glucagon II precur 


18 


113 


28 . 


1 


63 


1 


GCIDC 


glucagon precursor 


19 


112 


27 . 


9 


72 


1 


GCGXA 


glucagon precursor 


20 


111 . 5 


27 . 


7 


124 


1 


GCAF 


glucagon 1 precurs 


21 


109 


27 . 


1 


60 


1 


GCONC 


glucagon precursor 


22 


107 


26. 


6 


30 


2 


S44473 


glucagon~like pept 


23 


103 . 5 


25 . 


7 


87 


1 


GCFIS 


olucaaon Drecursor 

\A ^ \Jk \^ Wl \A \^ * « ^ Vp* V4 ^ \^ ^ 


24 


102 


25. 


4 


69 


1 


GCDG69 


glucagon~69 ~ dog 


25 


96 


23 . 


9 


29 


2 


S07211 


glucagon ~ marbled 


26 


95 


23. 


6 


31 


2 


S44472 


glucagon G2 - Nort 


27 


94 


23 . 


4 


29 


1 


GCDF 


glucagon ~ smaller 


28 


93 


23. 


1 


31 


2 


S44471 


glucagon Gl ~ Nort 


29 


93 


23. 


1 


39 


1 


HWGH4G 


exendin-4 - Gila m 


30 


92 


22 . 


9 


29 


1 


GCEN 


glucagon ^ elephan 


31 


89 


22. 


1 


29 


1 


GCOPV 


glucagon - North A 


32 


89 


22 . 


1 


29 


2 


A91740 


glucagon ~ turkey 


33 


89 


22 . 


1 


29 


2 


C39258 


glucagon - common 


34 


89 


22 . 


1 


29 


2 


A91742 


glucagon ~ Arabian 


35 


89 


22 . 


1 


29 


2 


A91741 


glucagon - rabbit 


36 


87 


21 . 


6 


29 


1 


A61583 


glucagon - ostrich 


37 


87 


21 


5 


29 


1 


GCDK 


rrliipanon — duck 


38 


87 


21 . 


6 


29 


1 


GCTTS 


glucagon — slider 


39 


87 


21. 


6 


29 


2 


C60840 


glucagon I - Europ 


40 


86 


21. 


4 


29 


1 


GCCB 


glucagon - Chinchi 


41 


85 


21. 


1 


29 


1 


GCFLE 


glucagon - Europea 


42 


85 


21. 


1 


29 


2 


A61135 


glucagon - bigeye 


43 


84 


20. 


9 


39 


1 


HWGH3Z 


exendin-3 - Mexica 


44 


82 


20. 


4 


29 


2 


S39018 


glucagon - bowfin 


45 


80 


19. 


9 


144 


2 


S71426 


glucose-dependent 



ALIGNMENTS 



RESULT 1 

GCBO 

glucagon precursor - bovine 

N; Contains: glicentin-related peptide; glucagon; glucagon-like peptide 1; 
glucagon-like peptide 2 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 14-Nov-1983 #sequence_revision 14-Nov-1983 #text_change 20-Mar-1998 
C;Accession: A93970; A92081; A01538 

R; Lopez, L.C.; Frazier, M.L.; Su, C.J.; Kumar, A.; Saunders, G.F. 
Proc. Natl. Acad. Sci. U.S.A. 80, 5485-5489, 1983 

A; Title: Mammalian pancreatic preproglucagon contains three glucagon-related 
peptides . 

A; Reference number: A93970; MUID: 83299996; PMID: 6577439 
A;Accession: A93970 
A;Molecule type: mRNA 
A; Residues: 1-180 <LOP> 

A; Cross-references: UNIPARC:UPI00001734FF; EMBL:K00107 
R;Bromer, W.W. ; Boucher, M.E.; Koffenberger Jr., J.E. 
J. Biol. Chem. 246, 2822-2827, 1971 
A; Title: Amino acid sequence of bovine glucagon. 



GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: January 3, 2006, 23:33:51 ; Search time 12 Seconds 

(without alignments) 

48.677 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-716-326-4 
402 

1 MKIILWLCVFGLFLATLFPI SYLEGQAAKEFIAWLVKGRG 78 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



57103 seqs, 7488799 residues 



Total number of hits satisfying chosen parameters: 



57103 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing; Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Published_Applications_AA_New: * 

1 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB . pep : * 

2 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB . pep : * 

3: /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep:* 

4 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep: * 

5: /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep:* 

6: /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep:* 

7: /cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB.pep:* 

8 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


169 


42.0 


70 


6 


US-10-997-061-28 


Sequence 


28 


, Appl 


2 


161.5 


40.2 


117 


6 


US-10-997'-061-9 


Sequence 


9, 


Appli 


3 


161.5 


40.2 


122 


6 


US-10-997-061-31 


Sequence 


31 


, Appl 


4 


158 


39. 3 


277 


6 


US-10-997-061-11 


Sequence 


11 


f Appl 


5 


157.5 


39.2 


119 


6 


US-10'997-061-13 


Sequence 


13 


, Appl 


6 


157 


39.1 


31 


6 


US-10-997-061-3 


Sequence 


3, 


Appli 


7 


154 


38.3 


31 


6 


US-10-997-061-7 


Sequence 


7, 


Appli 


8 


151 


37.6 


30 


6 


US-10-997-061-1 


Sequence 


1, 


Appli 


9 


151 


37.6 


31 


6 


US-10-997-061-4 


Sequence 


4. 


Appli 



10 


151 


37 . 


6 


31 


7 


US-11-112-277-9 


Sequence 


9, Appli 


11 


151 


37. 


6 


31 


7 


US-11-112-277-23 


Sequence 


23, Appl 


12 


151 


37. 


6 


31 


7 


US-11-112-277-24 


Sequence 


24, Appl 


13 


151 


37. 


6 


34 


6 


US-10-997-061-10 


Sequence 


10, AdpI 


14 


151 


37. 


6 


34 


6 


US-10-997-061-12 


Sequence 


12, Appl 


15 


150 


37 . 


3 


31 


7 


US-11-112-277-1 


Sequence 


1, Appli 


16 


150 


37. 


3 


31 


7 


US-11-112-277-10 


Sequence 


10, Appl 


17 


148 


36. 


8 


30 


6 


US-10-997-061-5 


Sequence 


5, Appli 


18 


148 


36. 


8 


31 


6 


US-10-997-061-8 


Sequence 


8, Appli 


19 


147 


36. 


6 


30 


7 


US-11-112-277-20 


Sequence 


20, Appl 


20 


146 


36. 


3 


30 


6 


US-10-997-061-2 


Sequence 


2. Appli 


21 


145 


36. 


1 


30 


7 


US-11-112-277-11 


Sequence 


11, Appl 


22 


145 


36. 


1 


30 


7 


US-11-112-277-12 


Sequence 


12, Appl 


23 


145 


36. 


1 


30 


7 


US-11-112-277-19 


Sequence 


19, Appl 


24 


145 


36. 


1 


30 


7 


US-11-112-277-21 


Sequence 


21, Appl 


25 


145 


36. 


1 


30 


7 


US-11-112-277-22 


Sequence 


22, Appl 


26 


145 


36. 


1 


31 


7 


US-11-112-277-26 


Sequence 


26, Appl 


27 


143 


35. 


6 


29 


7 


US-11-112-277-32 


Sequence 


32, Appl 


28 


143 


35, 


6 


30 


6 


Us-10-997-061-6 . 


Sequence 


6, Appli 


29 


98 


24 . 


4 


39 


7 


US- 11- 17 4-08 9-22 5 


Sequence 


225, App 


30 


97 


24 . 


1 


39 


7 


US-11-174-089-222 


Sequence 


222, App 


31 


95 


23 . 


6 


39 


7 


US- 11- 174-08 9-227 


Sequence 


227, App 


32 


93 


23. 


1 


29 


7 


US-11-174-089-66 


Sequence 


66, Appl 


33 


93 


23 . 


1 


30 


7 


US-11-174-089-3 


Sequence 


3, Appli 


34 


93 


23 . 


1 


30 


7 


US-11-174-089-4 


Sequence 


4, Appli 


35 


93 


23 . 


1 


30 


7 


US-11-174-089-26 


Sequence 


26, Appl 


36 


93 


23 . 


1 


31 


7 


US- 11- 17 4-08 9- 63 


Sequence 


63, Appl 


37 


93 


23 . 


1 


32 


7 


US-11-174-089-61 


Sequence 


61, Appl 


38 


93 


23 . 


1 


33 


7 


US-11-174-089-59 


Sequence 


59, Appl 


39 


93 


23 . 


1 


34 


7 


US-11-174-089-57 


Sequence 


57, Appl 


40 


93 


23. 


1 


35 


7 


US-11-174-089-55 


Sequence 


55, Appl 


41 


93 


23. 


1 


36 


7 


US-11-174-089-53 


Sequence 


53, Appl 


42 


93 


23. 


1 


36 


7 


US-11-174-089-73 


Sequence 


73, Appl 


43 


93 


23. 


1 


37 


7 


US-11-174-089-51 


Sequence 


51, Appl 


44 


93 


23. 


1 


37 


7 


US-11-174-089-70 


Sequence 


70, Appl 


45 


93 


23. 


1 


37 


7 


US-11-174-089-71 


Sequence 


71, Appl 



ALIGNMENTS 



RESULT 1 

US-10-997-061-28 

Sequence 28, Application US/10997061 
Publication No. US20050260701A1 
GENERAL INFORMATION: 
APPLICANT; Wagner, Fred W. 
APPLICANT: Luan, Peng 
APPLICANT: Xia, Yuannan 
APPLICANT: Bossard, Mary 
APPLICANT: Holmquist, Barton 
APPLICANT: Merrifield, Edwin H. 
APPLICANT: Strydom, Daniel 
APPLICANT: Restoragen Inc. 

TITLE OF INVENTION: Method for Enzymatic Production of GLP-l(7-36) Amide 
Peptides 

; FILE REFERENCE: 1627.003US1 



GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: January 3, 2006, 23:23:10 ; Search time 47 Seconds 

(without alignments) 
137.206 Million cell updates/sec 



Title: US-10-716-326-4 
Perfect score: 402 

Sequence: 1 MKIILWLCVFGLFLATLFPI SYLEGQAAKEFIAWLVKGRG 78 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 572060 seqs, 82675679 residues 



Total number of hits satisfying chosen parameters: 572060 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/l/iaa/5_COMB . pep : * 

2 : /cgn2_6/ptodata/ 1/iaa/ 6_C0MB . pep : * 

3 : /cgn2_6/ptodata/l/iaa/H_C0MB . pep : * 

4 : / cgn2_6/ptodata/ 1/iaa/ PCTUS_COMB . pep : * 

5 : /cgn2_6/ptodata/ l/iaa/RE_COMB . pep : * 

6 : /cgn2_6/ptodata/ 1/iaa/backf ilesl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


324 


80. 


6 


87 


2 


US- 


09- 


019- 


172-2 


Sequence 


2, Appli 


2 


198 


49. 


3 


64 


2 


us- 


10- 


360- 


101-228 


Sequence 


228, App 


3 


168 


41. 


8 


36 


2 


US- 


09- 


614- 


847-119 


Sequence 


119, App 


4 


168 


41. 


8. 


42 


2 


us- 


09- 


614- 


847-118 


Sequence 


118, App 


5 


167.5 


41. 


7 


180 


2 


us- 


08- 


784- 


582-56 


Sequence 


56, Appl 


6 


167.5 


41. 


7 


180 


2 


us- 


08- 


784- 


582-58 


Sequence 


58, Appl 


7 


167.5 


41. 


7 


180 


2 


us- 


08- 


784- 


582-61 


Sequence 


61, Appl 


8 


167.5 


41. 


7 


180 


2 


us- 


09- 


635- 


679E-2 


Sequence 


2, Appli 


9 


167.5 


41. 


7 


360 


2 


us- 


08- 


784- 


582-73 


Sequence 


73, Appl 


10 


163 


40. 


5 


31 


2 


us- 


09- 


209- 


799D-16 


Sequence 


16, Appl 


11 


163 


40. 


5 


31 


2 


us- 


09- 


614- 


847-123 


Sequence 


123, App 



12 


163 


40. 


5 


31 


2 


US-09-997-792A-14 


Sequence 


14, Appl 


13 


163 


40. 


5 


32 


2 


US-09- 614-84 7- 147 


Sequence 


147, Add 


14 


163 


40. 


5 


37 


1 


Us-08-095-162-19 


Sequence 


19, Appl 


15 


163 


40. 


5 


37 


1 


US- 08- 4 70-22 OA- 19 


Sequence 


19, Appl 


16 


163 


40. 


5 


37 


2 


US- 08- 9 67-37 4- 19 


Sequence 


19. AddI 


17 


163 


40. 


5 


37 


2 


US-09-302-596-1 


Seouence 


1. AddIi 


18 


163 


40. 


5 


37 


2 


US- 08-4 72-34 9-1 


Seouence 


1. Aooli 


19 


163 


40. 


5 


37 


2 


US-09- 623-61 8B-1 


Sequence 


1. AddH 


20 


163 


40 . 


5 


37 


2 


US -09- 3 3 3- 4 15-1 


Sequence 


1 . Add 1 i 


21 


163 

X nJ J 


40 . 


5 


37 


2 


US-09- 50 5- 991-19 

\J *^ \J J -J \J •J ^^X XJ 


•J ^^^^^^ 


19 AddI 


22 


163 


40 . 


5 


37 


2 


US-09-303-016-1 


G V4 Al W 


1 Anr>l i 

X, /^^^xx 


23 


163 


40 . 


5 


37 


2 


US-09- 657 -332A-1 


Sequence 


1. AddIi 


24 


163 


40 . 


5 


37 


2 


US-09- 614 -84 7 -122 


Sequence 


122, App 


25 


163 


40 . 


5 


37 


2 


Us-09-805-507-1 


Seaiience 


1 . AddI i 


26 


163 


40. 


5 


37 


2 


Us-09-876-388-1 


Sequence 


1, Appli 


27 


163 


40. 


5 


37 


2 


US- 09-8 59-804-1 


Sequence 


1 . AddIi 


28 


163 


40. 


5 


37 


2 


Us-10-287-892-1 


Sequence 


1 , App 1 i 


29 


163 


40. 


5 


37 


2 


us- 09- 9 4 3-084-1 


Sequence 


1. AddH 






40 




37 


2 


US-09-623-54 8A-343 


Secruence 


343. Add 


31 


163 


40 . 


5 


37 


2 


US- 10- 05 5-2 5 9-1 


Secfuence 

Uj \mA U W 1 1 W W 


1 AnDli 




X U -J 


4 0 


5 


37 


2 


US-09-657 -2 76-343 




34 3 Add 


33 


163 


40 . 


5 


37 


2 


US- 10-2 8 8- 34 0-1 


Seauence 


1- AddIi 


34 


163 


40 . 


5 


37 


2 


US-09-982-978-1 


Sequence 


1, Appli 


35 


160 


39 . 


8 


31 


2 


US-09-258-750-14 


Sequence 


14, Appl 


36 


1 60 


39 . 


8 


31 


2 


US- 09-2 5 8 -7 50- 15 


Secrupnce 

W \A M X i ^ 


15 AddI 


37 


160 


39 . 


8 


31 


2 


US-09-258-750-16 


Sequence 


16, Appl 


38 


160 


39 . 


8 


31 


2 


US-09-209-799D-28 


Sequence 


28, Appl 


39 


160 


39 . 


8 


31 


2 


US-09-398-111-14 


Sequence 


14, Appl 


40 


160 


39. 


8 


31 


2 


US-09-398-111-15 


Sequence 


15, Appl 


41 


160 


39. 


8 


31 


2 


US-09-398-111-16 


Sequence 


16, Appl 


42 


160 


39. 


8 


31 


2 


US-09-997-792A-25 


Sequence 


25, Appl 


43 


160 


39. 


8 


32 


2 


US-09-258-750-95 


Sequence 


95, Appl 


44 


160 


39. 


8 


32 


2 


US-09-398-111-95 


Sequence 


95, Appl 


45 


160 


39. 


8 


33 


2 


US-09-258-750-22 


Sequence 


22, Appl 



ALIGNMENTS 



RESULT 1 
US-09-019-172-2 

; Sequence 2, Application US/09019172A 

; Patent No. 6723530 

; GENERAL INFORMATION: 

; APPLICANT: Drucker, Daniel J. 

; TITLE OF INVENTION: POLYNUCLEOTIDES ENCODING PROEXENDIN, AND METHODS AND 
; TITLE OF INVENTION: USES THEREOF 
; FILE REFERENCE: 8607-013 

; CURRENT APPLICATION NUMBER: US/09/019, 172A 

; CURRENT FILING DATE: 1998-02-05 

; NUMBER OF SEQ ID NOS : 4 

; SOFTWARE: Patentin Ver. 2.0 

; SEQ ID NO 2 

LENGTH: 87 
; TYPE: PRT 

ORGANISM: LIZARD 
US-09-019-172-2 



GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



January 4, 2006, 21:56:58 ; Search time 3041 Seconds 

(without alignments) 

1200.063 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-716-326-4 
402 

1 MKIILWLCVFGLFLATLFPI 



SYLEGQAAKEFIAWLVKGRG 78 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



41078325 seqs, 23393541228 residues 



Total number of hits satisfying chosen parameters: 



82156650 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US10716326/runat_03012006_143904_8655/app_query. fasta^l 
3 

-DB=EST -QFMT=fastap -SUFFIX=p2n. rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 

-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR__MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US10716326_@CGN_l_l_5315__@runat_03012006_143904_8655 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV_TIMEOUT=12 0 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP= 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DEL0P=6 -DELEXT=7 

Database : EST:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 



10: 



gb_estl : * 
gb_est2:* 
gb_est3 : * 
gb_htc: * 
gb_est4 : * 
gb_est5 : * 
gb_est6: * 
gb_est7 : * 
gb_gssl : * 
gb_gss2 : 



11: gb_gss3:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No . 


S core 


Match 






X 


Description 


I 


178 


44. 


3 


557 


2 


BG6S621 8 


BG656218 


ib38e07 .y 


2 


174.5 


43. 


4 


400 


5 


rRn68 QQ9 


CB068992 


is33f09.y 


3 


174.5 


43. 


4 


61 1 




PAft 4 7 Qfl 1 


CA847981 


iq41a09.y 


A 

M 


X f J 9 ^ 


43. 


2 




•J 


RM*?! S0Q6 


BM315096 


ig43d08.y 


•J 


179 S 


42. 


9 


^ J >J 


•J 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: January 3, 2006, 23:30:21 ; Search time 163 Seconds 

(without alignments) 

199.943 Million cell updates/sec 

Title: US-10-716-326-4 

Perfect score: 402 

Sequence: 1 MKIILWLCVFGLFLATLFPI SYLEGQAAKEFIAWLVKGRG 78 



Scoring table: 
Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1867569 seqs, 417829326 residues 



Total number of hits satisfying chosen parameters: 1867569 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA_Main: * 

1: /cgn2_6/ptodata/l/pubpaa/US07__PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

3: /cgn2_6/ptodata/l/pubpaa/US09_PUBCOMB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB . pep : * 

5 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB . pep : * 

6 : / cgn2_6/ptodata/ 1/pubpaa/USl 1_PUBC0MB . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No.-. Score Match Length DB ID Description 



1 
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US-10-215-272-4 


Sequence 


4, Appli 


2 
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78 


4 


US-10-716-326-4 


Sequence 
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78 
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ALIGNMENTS 



RESULT 1 
US-10-215-272-4 

Sequence 4, Application US/10215272 
Publication No. US20040002468A1 
GENERAL INFORMATION: 
APPLICANT: Genzyme Corporation 
APPLICANT: Wadsworth, Samuel C. 
APPLICANT: Armentano, Donna 
APPLICANT: Gregory, Richard J. 
APPLICANT: Parsons, Geoffrey 

TITLE OF INVENTION: Methods of Treating Diabetes and Other 
TITLE OF INVENTION: Blood Sugar Disorders 
FILE REFERENCE: 2478.2019002 PCT 
CURRENT APPLICATION NUMBER: US/10/215,272 
CURRENT FILING DATE: 2002-08-07 
PRIOR APPLICATION NUMBER: US 60/310,982 
PRIOR FILING DATE: 2001-08-08 
NUMBER OF SEQ ID NOS : 54 



SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 4 
LENGTH: 78 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Amino acid sequence of Exendin-4 . GLP-lGly8 
US-10-215-272-4 

Query Match 100.0%; Score 402; DB 4; Length 78; 

Best Local Similarity 100.0%; Pred. No. 2.5e-39; 

Matches 78; Conservative 0; Mismatches 0; Indels 0; Gaps C 

Qy 1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

Qy 61 LEGQAAKEFIAWLVKGRG 7 8 

I I I I I I I I I I I I I I I I I I 
Db' 61 LEGQAAKEFIAWLVKGRG 78 



RESULT 2 
US-10-716-326-4 

Sequence A, Application US/10716326 
Publication No. US20040143104A1 
GENERAL INFORMATION: 
APPLICANT: Genzyme Corporation 
APPLICANT: Wadsworth, Samuel 
APPLICANT: Armentano, Donna 
APPLICANT: Gregory, Richard J. 
APPLICANT: Parsons, Geoffrey 

TITLE OF INVENTION: Methods of Treating Diabetes and Other Blood Sugar 
Disorders 

FILE REFERENCE: 5062CIP 

CURRENT APPLICATION NUMBER: US/10/716,326 
CURRENT FILING DATE: 2003-11-17 
PRIOR APPLICATION NUMBER: US 10/215,272 
PRIOR FILING DATE: 2002-08-07 
PRIOR APPLICATION NUMBER: US 60/310,982 
PRIOR FILING DATE: 2001-08-08 
NUMBER OF SEQ ID NOS: 54 
SOFTWARE: Pa tent In version 3.2 
SEQ ID NO 4 
LENGTH: 78 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER". INFORMATION: Amino acid sequence of Exendin-4 . GLP-lGly8 
US-10-716-326-4 

Query Match 100.0%; Score 402; DB 4; Length 78; 

Best Local Similarity 100.0%; Pred. No. 2.5e-39; 

Matches 78; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 



Qy 61 LEGQAAKEFIAWLVKGRG 78 

I I I I I I I I I I I I I I I I I I 
Db 61 LEGQAAKEFIAWLVKGRG 78 



•RESULT 3 
US-10-715-976-4 

Sequence 4, Application US/10715976 
Publication No. US20050107318A1 

GENERAL INFORMATION: 
APPLICANT: Genzyme Corporation 
APPLICANT: Wadsworth, Samuel 
APPLICANT: Arraentano, Donna 
APPLICANT: Gregory, Richard J. 
APPLICANT: Parsons, Geoffrey 

TITLE OF INVENTION: Methods of Treating Diabetes and Other Blood Sugar 
Disorders 

FILE REFERENCE: 5121 

CURRENT APPLICATION NUMBER: US/ 10/7 15 , 97 6 
CURRENT FILING DATE: 2003-11-17 
NUMBER OF SEQ ID NOS: 54 
SOFTWARE: PatentIn version 3.2 
SEQ ID NO 4 
LENGTH: 78 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Amino acid sequence of Exendin-4 . GLP-lGly8 
US-10-715-976-4 

Query Match 100.0%; Score 402; DB 5; Length 78; 

Best Local Similarity 100.0%; Pred. No. 2.5e-39; 

Matches 78; Conservative 0; Mismatches 0; Indels 0; Gaps C 

Qy 1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MKIILWLCVFGLFLATLFPISWQMPVESGLSSEDSASSESFAKRIKRHGEGTFTSDVSSY 60 



Qy 61 LEGQAAKEFIAWLVKGRG 78 

I I I I I I I I I I I I I I I I I I 
Db 61 LEGQAAKEFIAWLVKGRG 78 



RESULT 4 

US-10-215-272-20 

Sequence 20, Application US/10215272 
Publication No. US20040002468A1 
GENERAL INFORMATION: 
APPLICANT: Genzyme Corporation 
APPLICANT: Wadsworth, Samuel C. 
APPLICANT: Armentano, Donna 
APPLICANT: Gregory, Richard J. 
APPLICANT: Parsons, Geoffrey 

TITLE OF INVENTION: Methods of Treating Diabetes and Other 
TITLE OF INVENTION: Blood Sugar Disorders 



